Human Evaluation of Kea, an Automatic Keyphrasing System

ثبت نشده
چکیده

This paper describes an evaluation of the Kea automatic keyphrase extraction algorithm. Tools that automatically identify keyphrases are desirable because document keyphrases have numerous applications in digital library systems, but are costly and time consuming to manually assign. Keyphrase extraction algorithms are usually evaluated by comparison to author-specified keywords, but this methodology has several well-known shortcomings. The results presented in this paper are based on subjective evaluations of the quality and appropriateness of keyphrases by human assessors, and make a number of contributions. First, they validate previous evaluations of Kea that rely on author keywords. Second, they show Kea's performance is comparable to that of similar systems that have been evaluated by human assessors. Finally, they justify the use of author keyphrases as a performance metric by showing that authors generally choose good keywords.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Evaluation of Kea: An Automatic Keyphrase Extraction Algorithm

Keyphrases, often defined as keywords, are an important means of document summarization, searching, browsing, and clustering. This paper describes and evaluates Kea, an algorithm for automatically extracting keyphrases from text. Kea identifies candidate keyphrases using lexical methods, calculates TFIDF feature values for each candidate, and uses naïve Bayes learning scheme to predict keyphras...

متن کامل

UK Data Archive Keyword Indexing with a SKOS Version of HASSET Thesaurus

Apply automatic indexing tool, KEA, to some of the UK Data Archive’s document collection using HASSET thesaurus with aims to: • see whether KEA could potentially be used to aid metadata creation. • develop recommendation for the future use of automatic indexing with an existing thesaurus. Human Evaluation: Manually compare auto-keywords with manually assigned keywords. • strictly relevant: ‘exa...

متن کامل

A Refined Methodology for Automatic Keyphrase Assignment to Digital Documents

AbstrAct: Keyphrases precisely express the primary topics and themes of documents and are valuable for cataloging and classification. Manually assigning keyphrases to existing documents is a tedious task; therefore, automatic keyphrase generation has been extensively used to classify digital documents. Existing automatic keyphrase generation algorithms are limited in assigning semantically rele...

متن کامل

Evaluation of the Parameters Involved in the Iris Recognition System

Biometric recognition is an automatic identification method which is based on unique features or characteristics possessed by human beings and Iris recognition has proved itself as one of the most reliable biometric methods available owing to the accuracy provided by its unique epigenetic patterns. The main steps in any iris recognition system are image acquisition, iris segmentation, iris norm...

متن کامل

A Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation

Abstract   Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001